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Abstract: cDNA fragments covering the whole envelope (E) protein region of hepatitis C 
virus (HCV) have been molecularly cloned by using polymerase chain reaction (PCR) techni- 
que from the serum of five HCV infected patients in Nagasaki Prefecture. Their nucleotide 
and encoded amino acid sequences were determined and compared with other HCV cDNA 
clones isolated in Japan and the United States. The result showed that HCV cDNA isolates 
compared in this study were roughly divided into two subgroups; Japanese subgroup and 
USA subgroup. All HCV cDNA isolates in Nagasaki Prefecture were included in the former 
subgroup. Amino acid homology analysis showed that conformation of the E protein was 
relatively conserved among all compared HCV cDNA isolates, indicating that the E protein 


may be under structural or functional constraint. 
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INTRODUCTION 


Blood-borne non-A, non-B hepatitis (NANBH), which is caused by viral infection, is a 
major form of post-transfusion hepatitis (Feinstone, 1975) and frequently develops into 
chronic hepatitis and malignant liver diseases such as cirrhosis and hepatocellular carcinoma 
(Dienstag, 1983; Kiyosawa et al., 1984, Koretz et al., 1980; Realdi et al., 1982; Resnick ei al., 
1983). Recently, cDNA clones of probable hepatitis C virus (HCV) have been obtained by im- 
munoscreening of cDNA library derived from the plasma of a chimpanzee chronically in- 
fected with HCV in the United States (Choo et al., 1989). Using these clones, they subse- 
quently developed an immunoassay to detect specific anti-HCV antibodies (anti-C100) (Kuo et 
al., 1989). Seroepidemiological studies based on the detection of anti-C100 indicated that 
HCV not only is the major cause of post transfusion NANBH but also may have a pathogenic 
role in many cases of chronic liver disease (Alter et al., 1989; Kuo et al., 1989; Miyamura et 
al., 1990; Plageman, 1991; Saito et al., 1990). More recently, several research groups in 


DEENEN 
Received for Publication April 4, 1992. 
Contribution No. 2709 from the Institute of Tropical Medicine, Nagasaki University. 


40 


Japan have isolated cDNAs spanning partial or total length of HCV genome directly from 
human patients by using probes based on the sequence of the prototype isolates in the 
United States (Kato e£ al., 1990; Okamoto et al., 1990; Takamizawa et al., 1991; Takeuchi e£ 
al., 1990). From their analyses on primary structure of the viral genome, the HCV genome is 
considered to be a positive-stranded RNA molecule of about 10,000 nucleotides which en- 
codes a large polyprotein precursor of about 3,000 amino acids. A search for sequence 
homology of the HCV genome indicated that this virus is related to flaviviruses and 
pestiviruses (Houghton e£ al., 1988; Miller and Purcell, 1990). The gene products of HCV 
have not been identified, but HCV genome organization was speculated as shown in Figure 1 
(Takamizawa ef al., 1991), by comparing deduced amino acid sequence of the long open 
reading frame (ORF), its hydrophobicity profile, and the location of potential glycosylation 
sites with those of pesti- and flavivirus genomes. The HCV viral genome encodes one large 
polyprotein that is subsequently cleaved into three structural proteins, the core (C) protein, 
the matrix (M) protein, the envelope (E) protein, and six nonstructural proteins, NS1, NS2, 
NS3, NS4a, NS4b, and NS5. It is known that the epitopes which are recognized by the 
neutralizing antibodies exist on the E protein region in the case of flaviviruses. Therefore, 
the E protein region would become the first target for vacccine development of HCV infec- 
tion. In this study, by using polymerase chain reaction (PCR), a technique for DNA amplifica- 
tion in vitro (Saiki et al., 1988), a cDNA fragment of HCV genome which encodes the total E 
protein region have been molecularly cloned from the serum samples of 5 patinets with acute 
and chronic HCV infection in Nagasaki Prefecture. Their nucleotide and deduced amino acid 
sequences were comparatively analyzed with those of the HCV cDNAs isolated so far in 
other parts of Japan and in the United States in order to analyze the molecular epidemiology 
of local strains. These informations may also be useful for the strategies to develop the HCV 


vaccine. 
MATERIALS AND METHODS 


Preparation of RNA from human serum samples; Serum samples obtained from 
NANBH infected patients in Nagasaki Prefecture which were kindly provided by the Second 
Department of Internal Medicine of Nagasaki University Hospital were used. The serum 
samples were first screened the presence of anti-C100 by using Ortho Diagnostic enzyme-link- 
ed immunosorbent assay kit according to the manufacturer’s instruction. In the cases of 
chronic NANBH infection, anti-C100 positive serum samples were selected and used for 
RNA extraction. In the cases of acute NANBH infection, the serum samples were used for 
RNA extraction regardless of the presence of anti-C100 because it is known that the detec- 
tion of anti-C100 during acute phase of HCV infection (especially within one month since 
HCV infecton) is usually difficult (Alter e£ al., 1989; Kuo et al., 1989). To extract RNA, 100 
ul serum was made up to 200 yl with sterile distilled water and mixed with an equal volume 
of buffer containing 0.2 M "tris" — HCl (pH 7.5), 25 mM ethylenediaminetetra-acetic acid (ED- 
TA), 0.3 M NaCl, 2% sodium dodecyl sulfate (SDS), and 200 ug/ml proteinase K, and in- 
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cubated at 37°C for 40 min. Proteins were removed by two extractions with phenol and once 
with chloroform. Twenty micrograms glycogen was added to the aqueous phase and the 
RNA was precipitated by addition of three volumes of ice-cold absolute ethanol. After 
storage at —70 for 1 hr, the RNA was pelleted in an Eppendolf centrifuge (30 min, 
15,000 xg, 4). The pellet was washed once in 70% ethanol, vacuum dried, and dissolved in 
50 yl sterile distilled water. RNA solution was stored at —70 C. 

Primers for PCR reaction: The oligonucleotide primers were synthesized with an Ap- 
plied Biosystems DNA synthesizer (model 380B) and confirmed for purity by ion-exchange 
gel chromatography (Gen-pack; Waters) The sequences of primers were 5'-GTAATTTGG- 
GTAAGGTCATCGA' TAC-3' (sense) and 5’-TGCCAACTCCCATTGGTGTT-3 (antisense). This 
primer pair was expected to amplify 868 base pair DNA from putative region covering the 
whole E protein (nucleotide position: 707-1574) according to the sequences reported by 
Takamizawa et al. (1991) (Fig. 1). 

Reverse-transcription (RT)-PCR: Five microliters of RNA solution was added to 95 yl 
of RT-PCR mixture containing 100 pmol each primers, 0.2 mM each deoxynucleoside 
triphosphate, 10 mM Tris (pH 8.9), 1.5 mM MgCl, 80 mM KCl, 0.5 mg of bovine serum 
albumin per ml, 0.1% sodium cholate, 0.1% Triton X-100, 10 U of reverse transcriptase (Life 
Science Inc.), and 2 U of Tth DNA polymerase, a thermostable DNA polymerase (Toyobo 
Co.). The reaction mixture was covered by 2 drops of mineral oil, and the tube was in- 
cubated for 10 min at 53° for RT. PCR amplification (92°C for 60 sec, 53°C for 90 sec, and 
72°C for 120 sec by thermal cycler; Iwaki Co.) was started immediately after the RT and 
repeated 35 times. PCR product was subjected to 3% agarose gel electrophoresis, and 
amplified DNA fragments were visualized by ethidium bromide staining. 

Cloning and sequencing of HCV cDNA: The cDNA fragment amplified by RT-PCR 
was excised from the agarose gel, phosphorylated with T4 polynucleotide kinase (Nippon 
Gene Co.) and blunted with T4 DNA polymerase (Takara Co.). The modified DNA fragment 
was subcloned into the Smal site of pUC19. The nucleotide sequence of the isolated clone 
was determined by the dideoxy chain termination method (Sanger ef al., 1977). 
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Fig. l. Tentative genome organization of HCV according to Takamizawa et al. (1991) and 
mapping of cDNA clone. Nucleotides are numbered from the extreme 5’ end as in- 
dicated at the top. A proposed genetic organization of the HCV genome is indicated, 
with coding regions for the C, M and E protein as well as nonstructural proteins (NS1 
to NS5) shown as block, the 5' as well as 3' non-coding regions as bars. A thin box 
below HCV genome represent cDNA clones from PCR products which has been 
amplified from the serum of the HCV infected patients in Nagasaki Prefecture. 
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Comparison of nucleotide and amino acid sequences: The amino acid sequence was 
deduced from the nucleotide sequence of the cloned HCV cDNA, and the sequence of the E 
protein region was tentatively assigned by analysis of the putative cleavage sites and the 
hydrophobicity profiles of the encoded proteins according to the method reported by 
Takamizawa et al. (1991). The nucleotide and amino acid sequences of the E protein region 
of HCV cDNAs isolated in Nagasaki Prefecture and others were compared. These com- 
parisons were carried out with computer analysis by using DNASIS-Mac Version 1.0 soft- 
ware system (Hitachi Co., 1991). 


J RESULTS 


Isolation of the cDNA clones of HCV: by RT-PCR, target DNA fragments could be 
amplified from five patients’ serum samples and were isolated as five distinct cDNA clones 
of HCV in Nagasaki Prefecture, designated as HCV-J]N1-— HCV-JN5. Brief clinical profile of 
each patient is showed in Table 1. Two of them were acute hepatitis with history of blood- 
borne infection (HCV-JN1 and HCV-JN5). Remaining three were chronic hepatic diseases 
without apparent history of blood transfusion, therefore their routs of infection were 
unknown. 

Nucleotide sequences of the E gene of isolated HCV cDNAs: Figure 2 showed the 
nucleotide sequence of the E gene of HCV-JN1, an isolate in Nagasaki Prefecture, as well as 
the nucleotides that differ from this sequence for the four isolates in Nagasaki Prefecture 
(HCV-JN2~JN5), the five isolates in other parts of Japan; HCV-JYK (Takeuchi e£ al., 


Table 1. Clinical profiles of the patients from whose sera HCVs in Nagasaki Prefecture have 
been cloned by PCR 


Pt No. Date of serum Clinical Erud of ALT level at | Anti C—100 
(virus isolate) sampling diagnosis 


00 


transfusion sampling reactivity 


The abbreviations used in clinical diagnosis represents AH; acute hepatitis, PTAH; post-transfu- 
sion acute hepatitis, CH; chronic hepatitis, HCC; hepatocellular carcinoma, LC; liver cirrhosis respec- 
tively. 

*a: HCV has been transmitted from the blood transfused during gingival cancer operation. 
*b: This patient is a clinical doctor and HCV has been transmitted from a post-transfusion acute 
hepatitis patient through the syringe needle. 


LO 20 A0 40 s0 
HCV-JNl L TATGAAGTGC GCAACGTCTC CGGGATATAC CATGTCACGA ACGACTGCTC 50 
DCH -.IN2 green ues Gis Ee E VE SES 50 
HCV-JN3 L ---------- ------- Dro ---------- vemu TD e ne see eee ne S0 
HCV -.JN4 b -euesebi bns ee jo c QA FREE 50 
HCY -JNS L ---------- ------- Q-- ------ Gere ---------22 e ee S0 
HCY -IYK L ---------- --.----- G-- -O--------2. mem Fo C e e E 50 
HCV -JOC L --C--G---- ---+-+- Geer, Weieren SA 50 
HCV -JBK L --C------- A------ G-- --------- E S 50 
HCV-JJ1 i --CC------ ----- TC-A- A---C-T--T --------C- -T--T---C- SQ 
HCV -JJ4 Eden, unc es pn RHENO SQ 
HCV-US L --CC------ ++--- TC-A- G---C-T--- --C-----C- -T--T---C- SQ 
60 70 80 90 100 
HCV-JNl 51 CAACTCAAGC ATTGTGTATG AGGCAGTGGA CATGATCATC GATAGTCCOG 100 
HCV-JN2 S51 ---------- ---------- ---- G-C-«-- -G------- (e E au CREDE 190 
HCV-JN3 5l ----------.-- Q------- 2----- C--- --- A-----G C---QC---- 100 
HCV-JN4 JL eesenHeSS Ze sasan C--- -T----T--G C---QC---- 100 
HCV-JNS SI ---------- ---------- -.--- G-C--- -G------ G aee MM 100 
HCV-JYK 3l ---—------ ---------- ---- G-C--- -G------ -G C--GCC---- 100 
HCV-JGC gb ees quesos ei. ncm C--- --2---2--- -G C-C-CC---- 100 
HCV-JBK Sl. 4---Q--e4— Lernen e sur C--- -T------ -G C---C---T- 100 
HCV-JJl $1 T-----G--T -------- C- ----GCAC-- TGCC---C-G C---C---G- 100 
HCV-JJ4 Sl ---------- ---------- ------ Q--- -------- -G xoc asess 100 
HCV-US 91 T-----G--T -------- C- ----G-CC-- TGCC---C-G C-C-C---G- 100 
110 120 . 130 140 150 
HCV-JNI 101 GGTGCGTGCC CTGCGTTCGG GAGAATAACC TCTCCCGTTG CTGGGTAGCC 150 
HCV-JN2 101 ---------- --- T------ -- --C---T Q------.-- .22.2l122-- A 150 
HCV-JN3 101 ---------- ---------- ------2--. T C--------- qoc cuui A 150 
HCV -JN4 101 ---------- T-----Q--- ---- -Q-.--T E 22.22-22.22 -G 150 
HCV-JNS 101 ---------- --- T-----—- -- -TG-C---T Q------.--- nn -G 150 
HCV-JYK 101 ---------- ---------- ---- -C--TT C+-------- .-22---2-- -G 150 
HCV-JGC 191 ---------- ----- -C--- --- -G---TT -------.--.. .ll2.-.2-- 3G 150 
HCV -JBK 101 ---------- ----------. +s AGGC---T C----- -Q- --2---2.2-- «s 150 
HCV-JJ1 LOL ----T--C-- T------- -C ---GGC---G ----GA-G-- T-----G--G 150 
HCV -JJA LOL ---------- ---------- -- -G-C---A G--------- .....--. NT 150 
HCV -US 101 ------- CQ-- T-------- T ---GGC---Gd C---GA-G-- T-----G--G 1S0 
160 170 180 190 200 
HCV-JN1 151 CTCACTCCCA CGCTCGCGGC CAGGAACCCC MGCGTCCCCA CCACGACAAT 200 
HCV-JN2 151 ---------- ----------. --- -G-A-T- CAGCAT---- - T-------- 200 
HCV-JN3 151 ---------- ---------- ------ TTT- ---------- -.T-------- 200 
HCV-JN4À 151 -----C---- -----..---- .----- TT-- +-- — = Duoasacas 200 
HCV-JNS 1581 ---------- ---------.- ..2--.-- T-- -.- ASRaen a eher 200 
HCV-JYK 151 ---------- ----------. ------ TG-- ------.2--. - Damus T- 200 
HCV-JGC 151 ---------- ---------. -.------ AG- EG 200 
HCV-JBK 1581 ---------- ------- A-- ------ EE Aud Amd ise 200 
HCV-JJ1 151 A-G--C---- --G-A--CA- ----G--GG- -AAC-----G -G---CAGT- 200 
HCV-JIJ4 181 ---------- ---------- .----- TG-- --.-------- - Pisnika 200 
HCV-US 151 A-G--C--T- --G-G--CA- ----G-TOG- -AAC-----G -G---CAGC- 200 
210 220 230 240 250 
HCH TNL 201 ACGACGCCAC GTCGACTTIGC TCGTTGGGGC GGCTGCTTIC TGTTCCGCTA 250 
HCV-AIN2 201 ----GTA--- ---------- -.-2---.--.-2 ei E 22222-22222 250 
HCV~JN3 201 ------ T--- ---------- ---------. -.---.2----- s asa cns 250 
HCV-JN4 201 ---------- ----- Dann ---------- d se crime me (auo 250 
HCV -JNS 201 --------- bbb 250 
HCV-JYK 201 ----.------ ---------- --2-.----- Ax eet eer E ama pns 250 
HCV -JGC 201 ---------- ----- T---- ---------- -.--.--- S 250 
HCV-JBK 201 ---------- ---—-- TC--- ------—---- eee ewe ee fee een 250 
HCV-JJ1 201 T----- T--- A----TC--- -T--C---M3 C--CA-CC-- ----. -G--cc 250 
HCV-JJ4 QOL ---------- ---------- ---2------- ---------- -C------- 250 
HCV-US 201 T----- T--- A----TC--- -T--C---AG C--CA-CC-- ---- -G--cc 250 
260 270 280 290 300 
HCV-JN1 251 TGTACGTGGG GGATCTTTGC GGATCCGTTY TCCTTGTCTC CCAGCTGTTC 300 
HCV -JN2 251 ----T----- --- CQ--C--- ----- T---- ---- CA--C- meneen 100 
HCV -JN3 25l ---------- ------ C--- ----- T---- ---~ C----- -£--------- 300 
HCV-JN4 251 ---------- --- C--C--~ ----- T---- ------2--2-.- l2. A idu 300 
HCV-JNS 251 ----T----- -- -C--QC--- ----- T---- ---- m — 300 
HCV -JYK 251 ---------- ------ C--- ----- T---- ---- C----- ---------- 300 
HCV/-JOC 251 ------- T-- ------ Q--- -----+---- - T--C----- ---------- 300 
HCV-JBK 251 ---------- --- Cost em MESES T---- ---- C----- T--------- 300 
HCV-JJ1 251 -C-----7-77 onse -G--- --G-----C- -----A-TGG T--A----- T 300 
HCV-JJ4 251 ---------- -.---.- C--- ----- T---- -.-- C--2-- -----.---- 300 
HCV-US 251 -C-------- ---C--A--- --G--T--C- -T-----Cgg ---A------ 300 
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Fig. 2. Nucleotide sequences of the E genes of eleven HCV isolates, HCV-JNI~JN5, HCV- 
JYK, HCV-JGC, HCV-JBK, HCV-JJ1, JJ2, and HCV-US (nucleotides were numbered 
from the extreme 5’ end of the E gene). Only the changes from HCV-JN1 sequences 
are indicated for the other isolates. The identical nucleotides are given as bars. 
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HC'"7-JNI 
HCH -JN2 
HCH -JN3 
HCH -JNÀ 
HC'/-JN5 
HCH - INK 
HCV -JCC 
HCV -JBK 
HCH LI) 
HCH -JJA 
HC'7-US 


HCH INL 
HC'/-JN2 
HCH IN) 
HCV-JN4 
HCV -JN5 
HCV-JYK 
HCH -JGC 
HCV-JBK 
HCV-JJl 
HCV-JJ4 
HCV-US 


HCV-JN1 
HCH -JN2 
HCH -JN3 
HCV-JN4 
HCV-JNS 
HCV -JYK 
HCV -JGC 
HCV-JBK 
HCV-JJ1 
HCV-JJ4 
HCV-US 


HCV-JN1 
HCV -JN2 
HCV -JNJ 
HCH JN4 
HCV -JNS 
HCV -JYX 
HCV -JGC 
HCV -JBK 
HCV IL 
HCV -JJ4 
HCV-US 


HCV-JN1 
HCV-JN2 
HCH JN) 
HCV -JNÀ 
HCV-JNS 
HCV-JYK 
HCV-JGC 
HCV-JBK 
HCH TL 
HCV -JJ4 
HCV-US 


HCH ML 
HCV-JN2 
HCV-JN3 
HCV-JN4 
HCH NS 
HCV -JYK 
HCH -JGC 
HCH -JBK 
HCV -JJ1 
HCH IA 
HCV -US 


310 320 330 340 350 

JOL ACCTTCTCGC CTCGCCCGTA TGAGACAGTG CAGGACTGCA ATTGCTCACT 
JOL nme Acum een (Moe EOS X. ieu areis, eer GA- 
JOL ---------- -------- Qe ees (jM eme ege À- 
JUL eem cm pese. ee GT-- -£2--------- ---- T---À- 
JOL =-------- A- -G-A----C- --7---7---- A sees Seo) que GA- 
JUl. «mc ovem Que eee are SS M cpu e < Queso A- 
301 -------- A- --4-----2-2 screen G--A --A--T---- -------- A- 
dE Eeer: qe e Qe e peeesD-A emer Ts; eer A- 
JOL Saens T- -CA-G--CC- CTG----AC- --A-G----- ----T--TA- 
JUI seesse ee ee (e vem RENE Sse ear 5 eese A- 
JOD. -------- T- -CA-G--CC- CTG---GAC- --A-GT---- ------- TA- 
360 370 180 390 400 

391 CTATCCCGGC CATGTAACAG GTCATCGCAT GGCTTGGGAT  ATGATGATGA 
JJL reese Eeer qose qoe Ser Sah eine tesque 
JSL --— quse. es QgemTewe “4 Eet: "EE ueneno 
JSL ee, SEN Eeer Ee ere Ee G Ee 
191 eee TCG lores eue que ere cue pee 
dl jesse vem e (ecu mese. nue a peer. ema: 
JOl meom Eee ee Qus ees ences lxx Halen. eet teg 
351 ---------- ---- -GT-G- eigent ---------C seer eee 
35} ---C------ --- E tege ue A------ -------2--- 
JSL —-——————- T TaeTseem creme (QR. gie ue ele, demde s 
9b mosse wer A----G- ----C----- --- A------ Ó--------- 
410 420 430 440 450 

401 ACTGGICACC TACAGCAGCG CTAGIGGIGT CGCAATTACT CCGGATCCCA 
401 emos emm oC EEN, mene G--G-- ----7--7-7--- 
J401 -24 Gam ES AC ee, onum e EE 
Ir Reng Acme EE Imm ee Eeer 
401, eI (ee Cara oe ST G--G-- --------2-- 
401 ------ -G-- C--G-----C T--------- eer G----- --------2-- 
401 mmm omm teen, emer Ne: oem E ebe eege 
40i -------G-- C---A----C EE CC eges m meses soe 
401 ------- C-- ---G--G--- T-G--AA--G -T--GC-G-- ---------- 
401 ---------- ---- Ree imme ests -G--G-- ---2--2---- 
401 ------ -Q-- ---GA-G--- T-G--AA--G -T--GC-G-- ---------- 
460 470 480 490 500 

4S1 CAAGCCGTOG TGGACATGGT GGCTGGGGGC CACTGGGGGG TTCTOGCOGG 
45L assez Pas mee wem cH -G--A-C- --------À- -Q-------- 
451 ------ Auch -----2-2---7- -A-G---=2-C- S=s457"> A= Ce -CC 
45l ---- Tansa meme Tm SACSeeuU S menses c AA «C009 
451 ----- EE -G--A-C- --------À- -C--------7 
491 ----- qTessA accedo Soe (usce e eee m oe A- -C--AÀ----- 
ABl. demnm. LEE: cus mg marmo Ev mm qo excess 
45L em erue n Geeks eie mni À- -C-------- 
dl 5 A--T ----T---À- C----- VE emu A Eme 
45l ----- T---- -Ó------2--- om Gri Ce esses ns e cue 
ASL e ASCToRee Av Ct TOT emm A- oem 
510 520 530 540 950 

501 GCTCGCCTAC TATTCCATGG TGGGGAACTG GGCTAAGGTC TIGATTGTGA 
SOL C--T----74 EES EE LEE To semen eee 
501 Zeene, euet mm Sees A----- 0 --——--442 c--42022277 
SOL C--T--T--- ---------- treme rn c7 Qc T -As4—- 
SOL CQ--T------ ---------- ---7-0-2--7270 7007000077 (qoem me 
SOL CQ--T------ ---------- re 707700077 T ---------- 
50L CosTe-uceme meme. erue meum cemere scm mE IIT TU 
501 C--T------ ---------- e Esc dun E 
SOL CA-A--G--T -TC------- 772227 77 See C--G-A---C 
SU] ies eet EST 0T EE VE epe ees -G 
SOL CA-A--G--T -TC------- -777777277 77 SSES C--G-A---C 
560 570 $80 590 600 

551 TGTTACTCTT TGCCOGCGTT GACGGAAGCA CCGTCGTGAC AGGG...... 
$5] --C------- C---------2 -- T--GCAA- --CG------ G-a 
551 --C-G--T-- ---------- -77-7 -GGAT- --CG------ martes eee 
551 --Q----T-- ---------- --7- -GG-A- --CA----T- aas 
$51] --QC------- Q--------- -- T-COGCA- A-CG------ Goes 
$51 --C------- ---------- -77-7 -GCAT- --CG------ G---7... 
95] --C------- --- qeeeem utm -GCA-- --CA------ =t.e’ 
551 --C----T-- --- T------ -Ó--- -GGAT- --CA------ Ee 
$51 ----G--G-- --------- C ----QOGAA- --A----CT- C---...... 
551 C-Q------- -------7--- --7--7- GGAG- --TA-AC-T- G---.....- 
551 --C-G--A-- ---------C ----COCAA- -—CA---C-- Crew. .aee 


350 
350 
350 
350 
350 
350 
350 
350 
350 
150 
350 
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400 
400 
400 
400 
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1990), HCV-JGC (Kato et al., 1990), HCV-JBK (Takamizawa et al., 1991), HCV-JJ1 and HCV- 
JJ4 (Okamoto e£ al., 1990), and the prototype isolate in the United States; HCV-US (Choo e£ 
al., 1991). This figure showed that the nucleotide differences were scattered throughout the 
length of the gene, and there appeared to be no particular region of hypervariability. The 
numbers of nucleotide differences and the identity percent of the necleotidies between each 
of these isolates were summarized in Table 2 (a) and (b), respectively. These results in- 
dicated that the isolate in the United States (HCV-US) was less closely related to the isolates 
in Japan except for HCV-JJ1. Except HCV-JJ1, HCV-US showed an average of 25.5% 
nucleotide sequence difference with all Japanese HCV cDNA islates, and the difference 
among Japanese isolates was in the range from 4.2% to 12.1% (average 9.1%). HCV-JJ1, 
which was isolated from a HCV healthy carrier in Japan, was closely related to HCV-US 
showing 94.6% nucleotide sequence identity. Although pairs of other isolates in Japan show- 
ed similar nucleotide sequence identity each other, there was a pair of isolates which showed 
less nucleotide difference; HCV-JN2 and HCV-JN5, both of which were isolated in Nagasaki 
Prefecture, showed a 96.3% identity. 

Amino acid differences in the E protein of HCV isolates: The encoded amino acid se- 
quences of the E protein from HCV-JN1 isolate was shown, in the standard one letter code, 
in Figure 3, with different amino acids noted for the other isolates. Five potential N- 
glycosylation sites were found from the amino acid sequence of the E protein of HCV-JN1. 
Two of them were completely conserved in all other isolates. In the remaining three sites, 
amino acid sequences were different in some other isolates. But in two of these sites, the pat- 
tern of the amino acid sequence (N-X-T/S) were conserved in all other isolates. Various 
amino acid differences among HCV strains were scattered throughout the E protein, and 
there appeared to be no particular region of hypervariability. Table 2 (a) and (b) summarized 
the amino acid differences and identities percent between these isolates. These results in- 
dicated that HCV-US and HCV-JJ1 were less closely related to other Japanese isolates, 
similar to the nucleotide sequences comparison. The amino acid identity percent between 
each isolate were similar to those of the nucleotide identity indicating that most of the muta- 
tions were not silent. The difference between the amino acid sequence homology percent 
calculated by using the Dayhoff scoring matrix (Dayhoff and Schwartz, 1978) and the amino 
acid sequences identity percent were shown in Table 3. Homology among each isolates in- 
cluding also HCV-US and HCV-JJ1 showed high matching figure (average 95.9%), compared 
with those of the identity (average 87.1%) indicating that many of the mutations were not 
silent, but were neutral (conservative). Figure 4 showed the amino acid sequence of the E 
protein of HCV-JN1 isolate, as well as the amino acids that were not homologous with this se- 
quence for the other HCV isolates. There appeared to be three conserved regions (AA 
no.l~25, no.81~106, no.122~192). Figure 5 showed the relation of the hydrophobicity pro- 
file of the E protein of HCV-JN1 clone using Hopp and Woods’s program (1981) that is 
useful for prediction of protein antigenic determinants, and the locations of these three con- 


served regions. Several hydrophilic portions were observed on region A, but few on region B 
and C. | 
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(a) Nucleotide and predicted amino acid differences among eleven HCV isolates 


Table 2. 


in the gene coding for the envelope protein 
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Numbers above the diagonal represent amino acid differences among the HCV isolates, while those 


below the diagonal represent nucleotide differences. 


(b). Nucleotide and predicted amino acid identity % among eleven HCV isolates 


in the gene coding for the envelope protein 


AMINO ACID 


| SPST eT KL LIT EA 26 oe 
T e leien ei lui leo | zeit — 
<=! o. D * * . D LI D D * 
JERS SI SLSIS|SIS 1518 "E 
Sole ECH ER ER eS ER CHE a b: 
— 


CV= 


SÄEAEAEAEAEAEAEAEAE 
Ee E--A E NE NE. ME-2b— E ET 
Mian lol sl lien lol = 
Sek E pow pape SaSaD EE lr E 


94 
73 
94.6 


CV— 


96 
9] 
75.1 


ae [ae RL] aS [39 | 88 | o8 ae [ae E 
SS Eas de Rene oc el o Na Se Vë) See 
gQ| :o |t [t |t | 99 | tz : e 
Exec Jg s ue gu ~ 
Q| a e e Nr 
OD sla] so] rei co o oD LO 
kal O0 | mE my, mM], my] c tC ti 
| RLS] RP ae | 8€ ae [ae [ae | xe] x 
c |i: |i, c | wo fo | MH 7 bi tu 
Ble | S| o ER le lei lei |= 
E S: Ox[9»] ES A E Ai ml mw] nani re 
| |X] a8 [a8 | xX VIL |e] EA E 
Te oe A es SH e E 
SEAEAEAE $|8]8|8|S|E 
Fa 
EA EA E SSES EA ESA ESA EA E 
x pee MT dee "ch Isi ji d In iech TI oU cO 
TAS S/S $|58|8|8|8|5€8 
Zi © | © > Je INIS JNJ] ce 
ES Sx] o e$ leie li olm~l] Ss 
| | & LV ss} epee [sag RLJ L| eS | x 
Z| os SECHER 
Sch D o |o ajoa |oo |o |t-| GG 
| aS [ae L x] RL LLJ ESA EA EA x 
om | CO |= | Oo BK er E Bann | a ] Oo] oe 
5|8 $8|8|g8|s|$|8 |s|& 
Fa 
D E EE D AE AKT qu qub A 
eile le ole sip > 
OF OSS OF OS ORO SOS OnlOos/0 
HL EE EH EE EE LH mH ES mH n HS mH 


AdILOITONN 


90 
91 


CV 
4 


HCV 
4 


CV 


HCV 


74.2 


e 


HCV — | HCV 
N1 
CV 
US ENI 


Numbers above the diagonal represent amino acid identity % among the HCV isolates, while those 


below the diagonal represent nucleotide identity %. 


HCV-JNI 
HCV-JN2 
HCV-JNJ 
HCV -JN4 
HCV-JN5 
HCV-JYK 
HCV-JGC 
HCV-JBK 
HCV-JJ1 
HCV-JJ4 
HCV-US 


HCV-JNÍ 
HCV-JN2 
HCV-JN3 
HCV-JN4 
HCV-JN5 
HCV-JYK 
HCV-JGC 
HCV-JBK 
HCV-JJl 
HCV-JJ4 
HCV-US 


HCV-JNI 
HCV-JN2 
HCH -JNJ 
HCV-JN4 
HCV-JNS 
HCV-JYK 
HCV -JGC 
HCV-JBK 
HCV-JJ1 
HCV-JJ4 
HCV-US 


HCV-JNI 
HCV-JN2 
ICH JN3 
!HCV-JN4 
HCV-JN5 
HCV-JYK 
HCV-JGC 
HCV-JBK 
DCH Al 
HCV-JJ4 
HCV-US 


Fig. 3. 


151 
151 


10 20 30 40 50 
YEVRNVSGLY HVINDCSNSS_ IVYEAVUMII DSPGCVPCVR ENNLSRCWVA 
-----2--- M- ---------- -----A-V-M MT-------- -D-S------ 
we beeen ee ----12----- 2---- A-I-M HT-------- ---8------ 
---4--24-- wee enn ene -..-- A-L-M HT-------- ---S------ 
------- M- ---------- -----A-V-M HT-------- -D-S------ 
------2---- eee nee -2--- A-V-M HA-------- ---8------ 
l| AA A---M HT-------- -S-F------ 
-—-H------ -------- N- ----- A-L-M HT-------- - G-S------ 
-Q---ST-L- ------P--- -----H-A-L HT-------- - G-V------ 
wane eee ee 22--2----- ----- A---M HT-------- -D-S------ 
-Q---ST-L- ------P--- -----A-A-L HT-------- - G-A------ 

. 60 70 80 90 100 
JFPTLAARNP SVPTPTIRRH VDLLVGAAAF CSAMYVGDLC GSVFLVSQLF 
-------- EL QH------V- ---------- -------+--- -----Ip--- 
SE, E gege Ae, Ze 
See SE c 
SE Me GE 
-=---=--- A ------ L--- ------ T--- ---------- ----- I---- 
erer S -I-------- ch ---------- ---------- 
BENE V Ti-------- ---2------2 222222222. l-222-2-2--- 
M---V-T-DG Kb-A-QL--- I----- S-TL ---L------ ----- IG--- 
d E d eL nec Meee ere Ic 
M---V-T-DG KL-A-QU--- I----- S-TL ---b------ ------ G--- 

110 120 130 140 150 
TFSPRRYETV ODGNCSLYPG HVIGHRMAWD MMMNWSPTAA LVVSQLLRIP 
—— H--- ------I--- YLS------- ---------- ---------- 
Seas H--- ------[--- -L------2-- --+---+--T -2--------- 
SE H--Lh ------I--- --8-Q----- --2--2.2-q- +--------- 
----H-H--- ------ I--- YLS------- ---------- ---------- 
------ H--- ------I--- --8------- ------~--- ---------- 
me SE Lee, ^ Wess l--- --BS------- Riu mul an lg em cU gm mui mde bm me US Me mide dum 
T HV-T ------XI--- --S------- --------T- ---------- 
—— HW-T -G----I--- -I-------- ---------- --MA------ 
------ H--- ------I--- -LS------- ---------- ---------- 
-2-i-2- HW-T -G----I--- -I-------- ---------- --MA------ 

160 170 180 190 200 
QAVVDMVAGG HWGVLAGLAY YSMVGNWAKV LIVMLLFAGV DGSTIVVIG.. 
-42M----- A ease --- A------ ---------- -- Q-R---.. 
--IM---'T-A ------ p--- ---------- ---------- -- D-R--- 
— q-A ---I------ 22-22-2222 222-22-2-2-- --G-lH-8- 
---M----- A ---------- 2-----.-2--. l-.------- -AANR- - - 
Um -——————M———— ES H-R--- 
—— eege EE, Sate Sect Me H-IH--- 
E Aces ee hei -—————— ze D-H--- 
ANUS S Sasa [xs Ee, & V-L------ -AE-I-S- 
—— A wna een nae 2-2222---22 22-2-22---- ~-E-YTS- 
--lb--I--A ------- I-- F--------- -V-L------ -AE-MH---.. 


4T 


200 
200 


Envelope protein sequences derived by the translation of the nucleotide sequences 
shown in Fig. 1 of eleven HCV isolates (amino acid were numbered from the extreme 
5' end of the E protein). Only the amino acids that differ from HCV-JN1 are indicated 
for the other isolates. The identical amino acids are given as bars. The five potential 
N-glycosylation sites (N-X-T/S) are underlined. 
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Table 3. Amino acid homology 96 and identity % among eleven HCV isolates 
of the envelope protein 


AMINO ACID HOMOLOGY 
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ta above the uod represent amino acid homology % among the HCV isolates, while 
those below the diagonal represent nucleotide identity % same as numbers above the diagonal in 
Table 2 (b). 


Evolutional relationships among the HCV isolates: According to the method to 
calculate evolutional difference (Ks) between two homologous nucleotide sequences 
reported by Miyata and Yasunaga (1980), Ks for each pair of the compared HCV cDNA 
isolated has been calculated by using “EVO-DNA4” computer program (Okumura and 
Izui, 1990) (the program soft has been provided by Dr. K. Izui, Department of Chemistry, 
Faculty of Science, Kyoto University). Based on the Ks values, phyrogenetic relatinships 
among compared HCV cDNA isolates were determined by the unweighted pair-group 
method (Sneath and Sokal, 1973) (Fig. 6). From this result, HCV cDNA isolates compared 
in this study were roughly divided into two groups; the first group contains HCV-US and 
HCV-]]1 and the second group contains the remaining isolates. | 
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Fig. 4. Envelope protein sequences of eleven HCV isolates as same as shown in Fig. 3. The 
amino acids that are not homologous to HCV-JN1 are indicated. The homologous 
amino acids are given as bars. The sequence regions in where amino acids homology 
is highly conserved among the all eleven isolates are boxed. 
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Hydrophobicity profile [HCV-JN1] 


: 4 Y n i j 1 db . "Zum "y d | 


amino acid No. 


Fig. 5. Hydrophilicity profile of E protein from HCV-JN1. The bold bars given as region A, 
B, C indicate the location of the highly conserved sequence regions shown as boxed 
regions in Fig. 4. 
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Fig. 6. Phyrogenetic tree illustrating the evolutional relationship of the eleven HCV isolates. 
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DISCUSSION 


The comparison of nucleotide and amino acid sequences among various HCV cDNA 
isolates and the phyrogenetic tree calculated from these results showed that there was no 
closer relationships among the isolates in Nagasaki Prefecture compared with those in other 
parts in Japan. The result indicated that the compared HCV cDNA isolates in Japan, except 
HCV-JJ1, were included in one large subtype (Japanese subtype), and a isolate in the United 
States (HCV-US) was distinct from this subtype. HCV-JJ1 was considered to belong to the 
HCV-US because of its sequence relatedness. The nucleotide and amino acid differences 
were scattered throughout the length of the E protein region, and no particular hypervariable 
or conserved region was observed. Although the amino acid identity between each pair of the 
compared isolates showed some variation (75.3%~97.5%, average 87.1%), the amino acid 
homology based on the Dayhoff scoring matrix was well conserved (91.4%~99.0%, average 
95.9%) among all isolates including US subtype strains. While seve ^] highly conserved 
region were observed in the E protein sequence, indicating that many of amino acid muta- 
tions in the E protein region were conservetive. This result suggests that conformation of the 
HCV E protein is relatively conserved, indicating that the E protein region may be under 
structural or functional constraint. From the comparison with hydrophobicity profile of the E 
protein, one of the conserved region (region A in Fig. 5) may possibly contain the antigenic 
domain which function as neutralizing epitopes. Recently, Matsuura et al. (1991) expressed 
the HCV E protein gene as a 35 kDa glycoprotein (gp35) by using a recombinant 
baculovirus. By Western blot analysis using expressed gp35 as antigen, they tried to detect 
antibodies to gp35 in the sera from NANBH patients. From this experiment, they got follow- 
ing observations. Antibodies to gp35 were detectable in only 4%~11% of NANBH patients, 
not only in acute hepatitis but also chronic hepatic diseases (chronic hepatitis, liver cirrhosis, 
and hepatocellular carcinoma). In two of the acute hepatitis and one of the chronic hepatitis 
cases, patients have been recovered from hepatitis after appearance of anti-gp35 antibodies. 
The serum alanin aminotransferase level has been generally kept at lower level in the anti- 
gp3o seropositive chronic disease patients. These results raise two possibilities: (i) gp35 
does not contain neutralizing epitopes or has very low immonogenicity, but has some func- 
tions to inhibit viral proliferation, (ii) although gp35 contains neutralizing epitopes, their an- 
tigenic structure is different among various HCV because of the hypervariability of amino 
acid sequences. From the results presented in this parer, the second possibility does not 
seem to be probable. On the other side, some researchers proposed that the hypervariable 
region which exists in the NS1 region carries potential neutralizing epitopes and suggested 
the mechanism resembling to the second possibility mentioned above that make lead to the 
persistent infection of HCV (Hijikata et al., 1991; Kremsdolf et al., 1991; Weiner et al., 1991). 

. At present, several speculations and suggestions are under discussion concerning this 
problem. To get more precise and useful information, further experiments on HCV and 
establishment of the cell culture system for HCV will be necessary. 
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